An English-persian Automatic Speech Translator: Recent Developments in Domain Portability and User Modeling
نویسندگان
چکیده
In this paper we describe the English-Persian speech to speech translation device, and provide insight on the lessons learned during the first phase of development of this system. We start by giving an overview of the underlying components of the device: the front end ASR, the machine translation system and the speech generation system. Challenges such as the sparseness of available spoken language data and solutions that have been employed to maximize the obtained benefits from using these limited resources are described. In addition, we present results on system evaluation, which identify that a large degree of the system errors can not be attributed to machine performance, but rather to user operation and process losses. The identification of this large margin of improvement led to the initiation of a new research field: user modeling in cross-lingual mediated communication, and we describe our first results in user type classification.
منابع مشابه
The Transonics Spoken Dialogue Translator: An aid for English-Persian Doctor-Patient interviews
In this paper we describe our spoken english-persian medical dialogue translation system. We describe the data collection effort and give an overview of the component technologies, including speech recognition, translation, dialogue management, and user interface design. The individual modules and system are designed for flexibility, and to be able to leverage different amounts of available res...
متن کاملA Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملAutomatic Translation between English and Persian Texts
PEnTrans is an automatic bidirectional English/ Persian text translator. It contains two main modules, PEnT1,2. PEnT1 translates English sentences into Persian and PEnT2 performs translation from Persian to English. WSD which is an important part in translation is done in both systems by employing a combination of extended dictionary and corpus based approaches in PEnT1 and employing a combinat...
متن کاملTranslation Methodology in the Spoken Language Translator: An Evaluation
In this paper we describe how the translation methodology adopted for the Spoken Language Translator (SLT) addresses the characteristics of the speech translation task in a context where it is essential to achieve easy customization to new languages and new domains. We then discuss the issues that arise in any attempt to evaluate a speech translator, and present the results of such an evaluatio...
متن کاملUsing Corpora to Develop Limited-Domain Speech Translation Systems
The paper describes the Spoken Language Translator (SLT) system, a prototype automatic speech translator. SLT is currently capable of translating spoken English queries in the domain of air travel planning into either Swedish or French, using a vocabulary of about 1200 words. We present an overview of the system's architecture, concentrating on how rationally constructed balanced corpora are us...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006